AITopics | episodic backward update

Collaborating Authors

episodic backward update

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Su Young Lee, Choi Sungik, Sae-Young Chung

Neural Information Processing SystemsFeb-14-2026, 20:40:46 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, episodic backward update, transition, (12 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > Canada (0.04)

Industry: Leisure & Entertainment (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Neural Information Processing SystemsDec-26-2025, 02:31:23 GMT

We propose Episodic Backward Update (EBU) - a novel deep reinforcement learning algorithm with a direct value propagation.

episodic backward update, name change, sample-efficient deep reinforcement learning, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Su Young Lee, Choi Sungik, Sae-Young Chung

Neural Information Processing SystemsAug-20-2025, 07:38:02 GMT

We propose Episodic Backward Update (EBU) - a novel deep reinforcement learning algorithm with a direct value propagation.

algorithm, episodic backward update, transition, (12 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > Canada (0.04)

Industry: Leisure & Entertainment (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Reviews: Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Neural Information Processing SystemsJan-27-2025, 19:20:24 GMT

The paper proposes to use episodic backwards updates to improve data efficiency in RL tasks, furthermore they introduce a soft relaxation of this in order to combat the overestimation that typically comes from using backwards updates when using Neural Network models. Overall the paper is very clearly written. My main concerns with the paper are in the experimental details as well as in the literature review, also when taking into account the existing literature the novelty of the work is quite limited. The idea of using backwards updates is quite old and goes back to at least the 1993 paper "Prioritized Sweeping" by Moore and Atkeson, which in fact demonstrates a method that is very similar to what the authors propose and which the authors fail to cite. Furthermore recently there were quite a few papers operating in a similar space of ideas using a backward view in ways similar to the authors, e.g.: Fast deep reinforcement learning using online adjustments from the past, https://arxiv.org/abs/1810.08163

deep reinforcement learning, main concern, sample-efficient deep reinforcement learning, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Reviews: Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Neural Information Processing SystemsJan-27-2025, 19:20:13 GMT

All reviewers recommend accepting the paper. The authors response did address most of the reviewers' concerns. While the AC recommends accepting the paper, the AC encourages the authors to consider the comments of reviewer 1. Only changing the backup mechanism keeping all other hyper parameters fixed as in the Nature DQN model is indeed a good experimental setup. However, the optimal operation mode for different models might be different (even when sharing architectures and training protocols): for instance we could'afford' a larger learning rate if we have a better back-up mechanism.

episodic backward update, review, sample-efficient deep reinforcement learning, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)

Add feedback

Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Neural Information Processing SystemsOct-11-2024, 04:09:16 GMT

We propose Episodic Backward Update (EBU) – a novel deep reinforcement learning algorithm with a direct value propagation. Our computationally efficient recursive algorithm allows sparse and delayed rewards to propagate directly through all transitions of the sampled episode. We theoretically prove the convergence of the EBU method and experimentally demonstrate its performance in both deterministic and stochastic environments. Especially in 49 games of Atari 2600 domain, EBU achieves the same mean and median human normalized performance of DQN by using only 5% and 10% of samples, respectively.

episodic backward update, sample-efficient deep reinforcement learning

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Lee, Su Young, Sungik, Choi, Chung, Sae-Young

Neural Information Processing SystemsMar-18-2020, 21:16:16 GMT

episodic backward update, sample-efficient deep reinforcement learning

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Lee, Su Young, Choi, Sungik, Chung, Sae-Young

arXiv.org Machine LearningMay-31-2018

We propose Episodic Backward Update - a new algorithm to boost the performance of a deep reinforcement learning agent by a fast reward propagation. In contrast to the conventional use of the experience replay with uniform random sampling, our agent samples a whole episode and successively propagates the value of a state to its previous states. Our computationally efficient recursive algorithm allows sparse and delayed rewards to propagate efficiently through all transitions of a sampled episode. We evaluate our algorithm on 2D MNIST Maze environment and 49 games of the Atari 2600 environment and show that our method improves sample efficiency with a competitive amount of computational cost.

machine learning, reinforcement learning, transition, (13 more...)

arXiv.org Machine Learning

1805.12375

Country: Europe > United Kingdom > England (0.28)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback